New Experiments in Distributional Representations of Synonymy
نویسندگان
چکیده
Recent work on the problem of detecting synonymy through corpus analysis has used the Test of English as a Foreign Language (TOEFL) as a benchmark. However, this test involves as few as 80 questions, prompting questions regarding the statistical significance of reported results. We overcome this limitation by generating a TOEFL-like test using WordNet, containing thousands of questions and composed only of words occurring with sufficient corpus frequency to support sound distributional comparisons. Experiments with this test lead us to a similarity measure which significantly outperforms the best proposed to date. Analysis suggests that a strength of this measure is its relative robustness against polysemy.
منابع مشابه
A Revision of the Euphorbia Dioscoreoides Complex (Euphorbiaceae)
A revision of the Euphorbia dioscoreoides complex (subgenus Agaloma) is provided. Euphorbia dioscoreoides ssp. attenuata and E. eglandulosa, both from Mexico, are proposed as new; E. digitata is reduced to synonymy under E. subpeltata. Representative specimens are cited, and distributional and ecological data are provided.
متن کاملA checklist of stag beetles (Coleoptera: Scarabaeoidea: Lucanidae) from Iran.
An updated checklist of the Lucanidae (Coleoptera) from Iran is given. New locality records are listed and some dubious distributional records are discussed. Dorcus vavrai Nonfried, 1905 is placed in synonymy with Dorcus peyronis Reiche and Saulcy, 1856 (new synonymy) The female of Lucanus xerxes Král, 2004 is described. A key for the identification of the Iranian stag beetle species is also pr...
متن کاملA Computational Holographic Model of Memory for Abstract Associations
How do humans learn the syntax and semantics of words from language experience? How does the mind discover abstract relationships between concepts? Computationalrelationships between concepts? Computational models of distributional semantics can analyze a corpus to derive representations of word meanings in terms of each word’s relationship to all other words in the corpus. While these models a...
متن کاملBridging the distributional gap of Tylorida striata (Thorell, 1877) and new synonymy (Araneae: Tetragnathidae)
BACKGROUND Although Tyloridastriata has not been reported from India, observations on India Biodiversity Portal (IBP 2015), an open access repository for biodiversity information of Indian subcontinent, showed images resembling this species. The respective locality in Gujarat, India was explored and specimens were studied to confirm record of T.striata in India. Literature study showed some tax...
متن کاملNew species and records of Charisius Champion from Mexico and Central America (Coleoptera, Tenebrionidae, Alleculinae)
The species of the genus Charisius Champion, from Mexico and Central America are reviewed. The flightless genus Narses Champion, with one included species, N. subalatus Champion, is placed in synonymy with the genus Charisius. Four new species are described and illustrated, C. granulatus and C. punctatus (from Guatemala) and C. apterus and C. howdenorum (from Mexico). Charisius subalatus (Champ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005